Some New Features for Protein Fold Prediction
نویسندگان
چکیده
In this paper we propose several sets of new features for protein fold prediction. The first feature set consisting of 47 features uses only the sequence information. We also define four different sets of features based on hydrophobicity of amino acids. Each such set has 400 features which are motivated by folding energy modeling. To define these features we have considered pair-wise amino acids (AA) interaction potential. The effectiveness of the proposed feature sets is tested using multilayer perceptron and radial basis function networks to solve the 4 class (level 1) and 27 class (level 2) prediction problems as defined in the context of SCOP classification. Our investigation shows that such features have good discriminating powers in predicting protein folds.
منابع مشابه
Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks
Background: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from pro...
متن کاملFrom fold predictions to function predictions: automation of functional site conservation analysis for functional genome predictions.
A database of functional sites for proteins with known structures, SITE, is constructed and used in conjunction with a simple pattern matching program SiteMatch to evaluate possible function conservation in a recently constructed database of fold predictions for Escherichia coli proteins (Rychlewski L et al., 1999, Protein Sci 8:614-624). In this and other prediction databases, fold predictions...
متن کاملAdvanced quantitative MRI radiomics features for recurrence prediction in glioblastoma multiform patients
Introduction: Advanced quantitative information such as radiomics features derived from magnetic resonance (MR) image may be useful for outcome prediction, prognostic models or response biomarkers in Glioblastoma (GBM). The main aim of this study was to evaluate MRI radiomics features for recurrence prediction in glioblastoma multiform. Materials and Methods:</str...
متن کاملProtein Fold Recognition Using an Overlapping Segmentation Approach and a Mixture of Feature Extraction Models
Protein Fold Recognition (PFR) is considered as a critical step towards the protein structure prediction problem. PFR has also a profound impact on protein function determination and drug design. Despite all the enhancements achieved by using pattern recognitionbased approaches in the protein fold recognition, it still remains unsolved and its prediction accuracy remains limited. In this study,...
متن کاملFold and function predictions for Mycoplasma genitalium proteins.
BACKGROUND Uncharacterized proteins from newly sequenced genomes provide perfect targets for fold and function prediction. RESULTS For 38% of the entire genome of Mycoplasma genitalium, sequence similarity to a protein with a known structure can be recognized using a new sequence alignment algorithm. When comparing genomes of M. genitalium and Escherichia coli, > 80% of M. genitalium proteins...
متن کامل